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Abstract. We use an interesting result of probabilistic flavor concerning the 
product of two permutations consisting of one cycle each to find an explicit 
formula for the average number of block interchanges needed to sort a permu- 
tation of length n. 



1. Introduction 

1.1. The main definition, and the outline of this paper. Let p = p\Pi • • 'Pn 
be a permutation. A block interchange is an operation that interchanges two blocks 
of consecutive entries without changing the order of entries within each block. The 
two blocks do not need to be adjacent. Interchanging the blocks piPi+i ■ ■ ■ Pi+ a and 
Pj'Pi+i ' ' 'Pj+b with i + a < pj results in the permutation 

PlP2 ■ ■ -Pi-lPjPj+l ■ ■ -Pj+bPi+a+l ■ ■ -Pj-lPiPi+l ■ ■ -Pi+aPj+b+l ■ ■ 'Pn- 

For instance, if p = 3417562, then interchanging the block of the first two entries 
with the block of the last three entries results in the permutation 5621734. 

In this paper, we are going to compute the average number of block interchanges 
to sort a permutation of length n. The methods used in the proof are surprising 
for several reasons. First, our enumeration problem will lead us to an interesting 
question on the symmetric group that is very easy to ask and that is of probabilistic 
flavor. Second, this question then turns out to be surprisingly difficult to answer- 
the conjectured answer of one of the authors has only recently been proved by 
Richard Stanley [6] , whose proof was not elementary. 

1.2. Earlier Results and Further Definitions. The first significant result on 
the topic of sorting by block interchanges is by D. A. Christie [3], who provided a 
direct way of determining the number of block interchanges necessary to sort any 
given permutation p. The following definition was crucial to his results. 

Definition 1. The cycle graph G{p) of the permutation p = P1P2 ■ ■ ■ p n is a directed 
graph on vertex set {0, 1, • • • , n} and 2n edges that are colored either black or gray 
as follows. Set po = 0. 

(1) For < i < n, there is a black edge from pt to Pi-\, where the indices are 
to be read modulo n + 1, and 

(2) For < i < n, there is a gray edge from i to i + 1, where the indices are to 
be read modulo n + 1. 
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FIGURE 1. The graphs G(p) for p = 1234 and p = 4213. 

It is straightforward to show that G(p) has a unique decomposition into edge- 
disjoint directed cycles in which the colors of the edges alternate. Let c(G(p)) be the 
number of directed cycles in this decomposition of G(p). The main enumerative 
result of [3] is the following formula. In the rest of this paper, permutations of 
length n will be called n-permutations, for shortness. 

Theorem 1. The number of block interchanges needed to sort the n-permutation 
pis ^-f^)) . 

Note that in particular this implies that n+ 1 and c(G(p)) are always of the same 
parity. Christie has also provided an algorithm that sorts p using n+1 ~ c ^. G (p)) block 
interchanges. As the identity permutation is the only n-permutation that takes zero 
block interchanges to sort, it is the only n-permutation p satisfying c(G(p)) = n + 1. 

Theorem Q] shows that in order to find the average number a n of block inter- 
changes needed to sort an n-permutation, we will need the average value of c(G(p)) 
for such permutations. The following definition [4] will be useful. 

Definition 2. The Hultman number tSjr(n, /c) is the number of n-permutations p 
satisfying c(G(p)) = k. 

So the Hultman numbers are somewhat analogous to the signless Stirling num- 
bers of the first kind that count n-permutations with k cycles. 

This is a good place to point out that in this paper, we will sometimes discuss 
cycles of the permutation p in the traditional sense, which are not to be confused 
with the directed cycles of G{p), counted by c(G(p)). Following [4], the number 
of cycles of the permutation p will be denoted by c(r(s)). Indeed, the cycles of a 
permutation p are equivalent to the directed cycles of the graph T(p) in which there 
is an edge from i to j if p(i) — j. For instance, if p = 1234, then c(T(p)) = 4, while 
c(G(p)) =5. 

The following recent theorem of Doignon and Labarre [4] brings the Hultman 
numbers closer to the topic of enumerating permutations according to their cycle 
structure (in the traditional sense). Let S n denote the symmetric group of degree 
n. 

Theorem 2. The Hultman number Sn{n, k) is equal to the number of ways to 
obtain the cycle (12.. ■ ■ -n(n + 1)) G SVi+i as a product qr of permutations, where 
q G Sn+i is any cycle of length n+1, and the permutation r G S n +\ has exactly k 
cycles, that is c(r(r)) = k. 
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2. Our Main Result 

The following immediate consequence of Theorem [2] is more suitable for our 
purposes. 

Corollary 1. The Hultman number Sn{n,k) is equal to the number of (n + 1)- 
cycles q so that the product (12 • • • n(n + l))q is a permutation with exactly k cycles, 
that is, c(r((12 • • • n(n + l))q) = k. 

Example 1. For any fixed n, we have <Sjj(n,n+l) = 1 since c(G(p)) = n + 1 if and 
only if p is the identity permutation. And indeed, there is exactly one (n + l)-cycle 
(in fact, one permutation) q £ S n +\ so that (12.. • • ■ n(n + l))q has n + 1 cycles, 
namely q = (12 • ■■n(n + 1)) _1 = (l(n + l)n- • -2). 

In other words, finding the average of the numbers c(G(p)) over all n-permutations 
p is equivalent to finding the average of the numbers c(r((12.. • • ■ n(n + l))q), where 
q is an (n + l)-cycle. 

Let us consider the product s — (12 • • • n)z, where z is a cycle of length n. Let 
us insert the entry n + 1 into z to get the permutation z' so that n + 1 is inserted 
between two specific entries a and b in the following sense. 

)z(i) if i £ {a, n + 1}, 
n + 1 if i = a, and 
b if i = n + 1. 

See Figure [2] for an illustration. 




Figure 2. How z' is obtained from z. 

The following proposition is the first step towards describing how the Hultman 
numbers grow. 

Proposition 1. Let a, b, and z' be defined as above, and let s' = (12 • • • (n + l)z' . 
Then we have 

)c(r(s)) — 1 if 2 < a, and a — 1 and z(l) are not in the same cycle of s, 
c(r(s)) + 1 if2<a, and a — 1 and z(l) are in the same cycle of s, and 
c(r(s)) + l ifa = l. 
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Proof. Let us assume first that a > 2, and that a — 1 is in a cycle C\ of s, and z(l) is 
in a different cycle C2 of s. Let C\ = ((a — 1)6 • • • ) and let C2 = (z(l) ■ ■ ■ n). After 
the insertion of n + 1 into 2, the newly obtained permutation s' = (12.. • • • (n+ l)z' 
sends a — 1 to n+ 1, then n + 1 to z(l), then leaves the rest of C2 unchanged till 
its last entry. Then it sends n back to z'(n + 1) = b, from where it continues with 
the rest of C± with no change. So in s', the cycles C\ and C'2 are united, the entry 
n + 1 joins their union, and there is no change to the other cycles of s. See Figure 
[5] for an illustration. 



a-1 




n+1 



Figure 3. If a — 1 and z(l) are in different cycles of s, those cycles 
will turn into one. 



Let us now assume that a > 2, and that a — 1 and z(l) are both in the same 
cycle C of s. Then C = ((a — 1)6 • • • n(z(l)) ■ ■ ■ ). After the insertion of n + 1 into 
z, the newly obtained permutation s' = (12. . ■ ■ ■ (n + l)z' sends a - 1 to n+1, then 
n + 1 to z(l), cutting off the part of C that was between a — 1 and n. So C is split 
into two cycles, the cycle C = ((a — l)(n + 1)^(1) • ■ ■ ) and the cycle C" — (6 • ■ • n). 
Note that s'(n) = b since z'(n+ 1) = 6. See Figure [2] for an illustration. 

Finally, if a = 1, then s'(n + 1) = (n + 1), and the rest of the cycles of s do not 
change. □ 

Let TC n denote the set of n-permutations that can be obtained as a product of 
two cycles of length n. Let a n be the average number of cycles of the elements of 

TC n . 

Lemma 1. For all positive integers m, we have a2 m +2 = a 2m+i + 2m+i ' 

Proof. We apply Proposition [TJ with n replaced by n + 1 = 2m + 1. This means 
n + 1 is an odd number. So z' is a cycle of length 2m + 2 obtained from a cycle 
z of length 2m + 1 through the insertion of the maximal element 2m + 2 into one 
of 2m + 1 possible positions. By Proposition [TJ some of these insertions increase 
c(G(s)) by one, and others decrease c(G(s)) by one, depending on whether a — 1 
and z(l) are in the same cycle of s or not. The question is, of course, how many 
times they will be in the same cycle of s. 
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a-1 



z(l) 



n+1 



Figure 4. If a — 1 and z(l) are in the same cycle of s, that cycle 
will split into two cycles. 



This question is easily seen to be equivalent to the following question. 

Question 1. Let i and j be two fixed elements of the set {1, 2, • • • , h} Select an 
element p ofTCh at random. What is the probability that p contains i and j in the 
same cycle? 

The first author of this article has conjectured that the answer to this question 
was 1/2 for odd h. This conjecture was recently proved by Richard Stanley [6], 
who also settled the question for even values of h. 

Theorem 3. [S] Let i and j be two fixed, distinct elements of the set [h] = 
{1,2,- •• , h}, where h > 1. Let x and y be two randomly selected h-cycles over 
[h]. Let p(h) be the probability that i and j are in the same cycle of xy. Then 



p(h) 



i if h is odd, and 

1 2 



2 (h-lKH-2) */ k lS eVeU - 

The proof of Lemma [T] is now straightforward. If a ^ 1, then a — 1 and z(l) 
are equally likely to be in the same cycle or not in the same cycle of s. Therefore, 
an increase of one or a decrease of one in c(G(s)) is equally likely. If, on the other 
hand, a = 1, which occurs in l/(2m+ 1) of all cases, then c(G(s)) increases by one. 
So 

2m 1 1 

a2m+2 ~ 2^TT fl2m+1 + 2^TT (a2m+1 + 1] ~ a2m+1 + 2^TT 

□ 

Note that the statement of Lemma [T] holds even when m = 0, since 02 = 2 = 
ax + 1. 

As Theorem [3] provided a formula for P(h) for even values of h as well as odd 
values, we can state and prove the analogous version of Lemma [T] for the integers 
not covered there. 
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Lemma 2. For all positive integers m, we have 

mil 

a2m+i — a 2 , n + —, —-, —7 — a 2m + -z ; — T- 

2(m + l)(m + 2) 2m m(m + 1J 

Proof. This is very similar to the proof of Lemma [1] If a ^ 1 , which happens in 
(2m — 1) /(2m) of all cases, then the probability of a — 1 and z(l) falling into the 
same cycle of s is \ — ( 2 m+i)(2m+4) by Theorem [3] By Proposition [TJ in these cases 
c(G(s)) grows by one. If a = 1, which occurs in l/(2m + 2) of all cases, c(G(s)) 
always grows by one. So 

2m - 1 / 1 2 \ 

a2m+i — — = ' o ~ 77, TTTo ToT (° 2m + 

2m \2 (2m — lj(2m + 2) / 



2m- 1 /l 2 



2m V 2 (2m-l)(2m + 2) 

— (a 2m + 1), 
Im 



(a2m - 1) 



which is equivalent to the statement of the lemma as can be seen after routine 
rearrangements. □ 

We are now in position to state and prove our formula for the average number 
a n of cycles in elements of TC n . 

Theorem 4. We have a\ = 1, and 

n-l 



I ^1 

0/21 +1 



L(n-1)/2J+. 

Note that this formula produces even the correct value even for n = 1, that is, 
it produces the equality a\ = 1. 

Proof, (of Theorem^]) The statement is now a direct consequence of Lemmas [T] and 
[2] if we note the telescoping sum Xh=i Wpp\\ = ^ ~ tTI obtained when summing 
the values computed in Lemma [5] □ 

Note that it is well-known that on average, an n-permutation has Yj2=i ' c y _ 
cles. This is the average value of c(T(p)) for a randomly selected ro-permutation. 
Theorem |4] shows that the average value of c(G(p)) differs from this by about l/n. 

Finally, our main goal is easy to achieve. 

Theorem 5. The average number of block interchanges needed to sort an n-permutation 
is 



n - 



|n/2j+l A^i=2 



En X 
i=2 i 



Proof. By Theorem[2]and TheoremHl the average number value of c(G(p)) over all 
permutations p of length n is a n +i = v^wv+i ~^'Y^i=i ?■ Our claim now immediately 
follows from Theorem [T] □ 

So the average number of block interchanges needed to sort an n-permutation is 
close to (n — logn)/2. 
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3. Remarks and Further Directions 

Richard Stanley's proof of Theorem [3] is not elementary. It uses symmetric 
functions, exponential generating functions, integrals, and a formula of Boccara pQ. 
A more combinatorial proof of the stunningly simple answer for the case of odd k 
would still be interesting. 

As pointed out by Richard Stanley [7] , there is an alternative way to obtain the 
result of Theorem|4] without using Thcorcm[3l but that proof in turn uses symmetric 
functions and related machinery. It is shown in Exercises 69(a) and 69(c) of [8] that 

, L(»-i)/2j 

(1) P n (q) = E 1 C(riP)) = Al+TV E c(n+l,n-2 l ) q ^\ 

p€TC n \ 2 I i=0 

where, as usual, c(n, k) is a signless Stirling number of the first kind, that is, the 
number of permutations of length n with k cycles. Now a n can be computed by 
considering P ? '(l), which in turn can be computed by considering the well-known 
identity 

n+1 

F n+1 (x) = E c{n + l,k)x k =x(x + !)■■■ (x + n), 

k=\ 

and then evaluating F/ l+1 (l) + i 7 n+1 (— 1). 

The present paper provides further evidence that the cycles of the graph G{p) 
have various enumerative properties that are similar to the enumerative properties 
of the graph T(p), that is, the cycles of the permutation p. This raises the question 
as to which well-known properties of the Stirling numbers, such as unimodality, 
log-concavity, real zeros property, hold for the Hultman numbers as well. (See for 
instance Chapter 8 of [2] for definitions and basic information on these properties.) 
A simple modification is necessary since <Sfj(n, k) — if n and k are of the same 
parity. So let 

J2 P £TC <? c ( r ^/ 2 if n is even 
EpgtcI g (c(r(p))+1)/2 if n is odd. 

While the coefficients of P n (q) are all the Hultman numbers Sn{n — 1,1), 
Sn(n— 1, 2), ■ • • , Sh(ti — 1, n— 1), the coefficients of Q n (q) are the nonzero Hultman 
numbers Sn{n — 1, k). 

Clearly, Q n (q) — P n {q 2 ) if n is even, and Q n (q) = qP n (q 2 ) if n is odd. However, 
Exercise 69(b) of [8] shows that all roots of P n (q) have real part 0. Hence the roots 
of Q n {q) are all real and non-positive, from which the log-concavity and unimodality 
of the coefficients of Q n {q) follows. This raises the question of whether there is a 
combinatorial proof for the latter properties, possibly along the lines of the work 
of Bruce Sagan [5] for the Stirling numbers of both kinds. Perhaps it is useful to 
note that (fT]) and Theorem [2] imply that 

f c(n + 2,k)/( n t 2 ) if n-k is odd, 
S H (n,k) = < \ ' ". v 2 i 
y if n — k is even. 

Finally, to generalize in another direction, we point out that it is very well-known 
(see, for example, Chapter 4 of [2]), that if we select a n-permutation p at random, 
and i and j are two fixed, distinct positive integers at most as large as n, then the 
probability that p contains i and j in the same cycle is 1/2. Theorem |3] shows that 
if n is odd, then the set TC n behaves just like the set S n of all permutations in this 



Qn(q) 
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aspect. This raises the question whether there are other naturally defined subsets 
of n-permutations in which this phenomenon occurs. 
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